Practice questions

Answer the following questions:

  1. What is a key difference between area data and point data?
  2. What is a choropleth map?
  3. What is a cartogram?
  4. What are the advantages and disadvantages of these mapping techniques?

Learning objectives

In this activity, you will:

  1. Use the concept of quadrats to analyze a real dataset.
  2. Learn about a quadrat-based test for randomness in point patterns.
  3. Learn how to use the p-value of a statistical test to make a decision.
  4. Think about the distribution of events in a null landscape.
  5. Think about ways to decide whether a landscape is random.

Suggested reading

O’Sullivan D and Unwin D (2010) Geographic Information Analysis, 2nd Edition, Chapter 7. John Wiley & Sons: New Jersey.

Preliminaries

For this activity you will need the following:

It is good practice to clear the working space to make sure that you do not have extraneous items there when you begin your work. The command in R to clear the workspace is rm (for “remove”), followed by a list of items to be removed. To clear the workspace from all objects, do the following:

rm(list = ls())

Note that ls() lists all objects currently on the worspace.

Load the libraries you will use in this activity. In addition to tidyverse, you will need spatstat, a package designed for the analysis of point patterns (you can learn about spatstat here and here):

library(tidyverse)
package <U+393C><U+3E31>tidyverse<U+393C><U+3E32> was built under R version 3.4.3-- Attaching packages --------------------------------------------- tidyverse 1.2.1 --
v ggplot2 2.2.1     v purrr   0.2.4
v tibble  1.4.1     v dplyr   0.7.4
v tidyr   0.7.2     v stringr 1.2.0
v readr   1.1.1     v forcats 0.2.0
package <U+393C><U+3E31>ggplot2<U+393C><U+3E32> was built under R version 3.4.2package <U+393C><U+3E31>tibble<U+393C><U+3E32> was built under R version 3.4.3package <U+393C><U+3E31>tidyr<U+393C><U+3E32> was built under R version 3.4.3package <U+393C><U+3E31>purrr<U+393C><U+3E32> was built under R version 3.4.3package <U+393C><U+3E31>dplyr<U+393C><U+3E32> was built under R version 3.4.3package <U+393C><U+3E31>forcats<U+393C><U+3E32> was built under R version 3.4.3-- Conflicts ------------------------------------------------ tidyverse_conflicts() --
x dplyr::filter() masks stats::filter()
x dplyr::lag()    masks stats::lag()
library(rgdal)
Loading required package: sp
rgdal: version: 1.2-11, (SVN revision 676)
 Geospatial Data Abstraction Library extensions to R successfully loaded
 Loaded GDAL runtime: GDAL 2.2.0, released 2017/04/28
 Path to GDAL shared files: C:/Users/Antonio/Documents/R/win-library/3.4/rgdal/gdal
 Loaded PROJ.4 runtime: Rel. 4.9.3, 15 August 2016, [PJ_VERSION: 493]
 Path to PROJ.4 shared files: C:/Users/Antonio/Documents/R/win-library/3.4/rgdal/proj
 Linking to sp version: 1.2-5 
library(broom)
library(cartogram)
package <U+393C><U+3E31>cartogram<U+393C><U+3E32> was built under R version 3.4.3
library(plotly)
package <U+393C><U+3E31>plotly<U+393C><U+3E32> was built under R version 3.4.3
Attaching package: <U+393C><U+3E31>plotly<U+393C><U+3E32>

The following object is masked from <U+393C><U+3E31>package:ggplot2<U+393C><U+3E32>:

    last_plot

The following object is masked from <U+393C><U+3E31>package:stats<U+393C><U+3E32>:

    filter

The following object is masked from <U+393C><U+3E31>package:graphics<U+393C><U+3E32>:

    layout

In the practice that preceded this activity, you learned about the area data and visualization techniques for area data.

Begin by loading the data that you will use in this activity:

Hamilton_CT <- readOGR(".", layer = "Hamilton CMA CT", integer64 = "allow.loss")
OGR data source with driver: ESRI Shapefile 
Source: ".", layer: "Hamilton CMA CT"
with 188 features
It has 255 fields
Integer64 fields read as signed 32-bit integers:  ID POPULATION PRIVATE_DW OCCUPIED_D ALL_AGES AGE_4 AGE_5_TO_9 AGE_10_TO_ AGE_15_TO_ AGE_15 AGE_16 AGE_17 AGE_18 AGE_19 AGE_20_TO_ AGE_25_TO_ AGE_30_TO_ AGE_35_TO_ AGE_40_TO_ AGE_45_TO_ AGE_50_TO_ AGE_55_TO_ AGE_60_TO_ AGE_65_TO_ AGE_70_TO_ AGE_75_TO_ AGE_80_TO_ AGE_85 MEDIAN_AGE MALE_ALL_A MALE_4 MALE_5_TO_ MALE_10_TO MALE_15_TO MALE_15 MALE_16 MALE_17 MALE_18 MALE_19 MALE_20_TO MALE_25_TO MALE_30_TO MALE_35_TO MALE_40_TO MALE_45_TO MALE_50_TO MALE_55_TO MALE_60_TO MALE_65_TO MALE_70_TO MALE_75_TO MALE_80_TO MALE_85 MALE_MEDIA FEMALE_ALL FEMALE_4 FEMALE_5_T FEMALE_10_ FEMALE_15_ FEMALE_15 FEMALE_16 FEMALE_17 FEMALE_18 FEMALE_19 FEMALE_20_ FEMALE_25_ FEMALE_30_ FEMALE_35_ FEMALE_40_ FEMALE_45_ FEMALE_50_ FEMALE_55_ FEMALE_60_ FEMALE_65_ FEMALE_70_ FEMALE_75_ FEMALE_80_ FEMALE_85 FEMALE_MED MARRIED_AG MARRIED_OR MARRIED COMMON_LAW UNMARRIED SINGLE SEPARATED DIVORCED WIDOWED MARRIED_A1 MARRIED_O1 MARRIED_M COMMON_LA1 UNMARRIED_ SINGLE_M SEPARATED_ DIVORCED_M WIDOWED_M MARRIED_A2 MARRIED_O2 MARRIED_F COMMON_LA2 UNMARRIED1 SINGLE_F SEPARATED1 DIVORCED_F WIDOWED_F FAMILIES_I FAMILY_SIZ FAMILY_SI1 FAMILY_SI2 FAMILY_SI3 COUPLE_FAM COUPLE_MAR COUPLE_MA1 COUPLE_MA2 COUPLE_MA3 COUPLE_MA4 COUPLE_MA5 COUPLE_COM COUPLE_CO1 COUPLE_CO2 COUPLE_CO3 COUPLE_CO4 COUPLE_CO5 SINGLE_PAR SINGLE_PA1 SINGLE_PA2 SINGLE_PA3 SINGLE_PA4 SINGLE_PA5 SINGLE_PA6 SINGLE_PA7 SINGLE_PA8 CHILDREN_F CHILDREN_1 CHILDREN_2 CHILDREN_3 CHILDREN_4 CHILDREN_5 POPULATIO1 POPULATIO2 POPULATIO3 POPULATIO4 POPULATIO5 POPULATIO6 POPULATIO7 POPULATIO8 POPULATIO9 POPULATI10 POPULATI11 POPULATI12 PRIVATE_HO PRIVATE_HH PRIVATE_H1 PRIVATE_H2 PRIVATE_H3 PRIVATE_H4 PRIVATE_H5 PRIVATE_H6 PRIVATE_H7 PRIVATE_H8 PRIVATE_H9 PRIVATE_10 PRIVATE_11 PRIVATE_12 PRIVATE_13 PRIVATE_14 PRIVATE_15 OCC_PRIVAT OCC_PRIVA1 OCC_PRIVA2 OCC_PRIVA3 OCC_PRIVA4 OCC_PRIVA5 OCC_PRIVA6 OCC_PRIVA7 OCC_PRIVA8 OCC_PRIVA9 PRIVATE_16 PRIVATE_17 PRIVATE_18 PRIVATE_19 PRIVATE_20 PRIVATE_21 PRIVATE_22 PRIVATE_23 NATIVE_LAN NATIVE_LA1 NATIVE_LA2 NATIVE_LA3 NATIVE_LA4 NATIVE_LA5 NATIVE_LA6 NATIVE_LA7 NATIVE_LA8 NATIVE_LA9 NATIVE_L10 NATIVE_L11 NATIVE_L12 NATIVE_L13 NATIVE_L14 NATIVE_L15 NATIVE_L16 NATIVE_L17 NATIVE_L18 NATIVE_L19 NATIVE_L20 NATIVE_L21 NATIVE_L22 NATIVE_L23 NATIVE_L24 NATIVE_L25 NATIVE_L26 NATIVE_L27 NATIVE_L28 NATIVE_L29 NATIVE_L30 NATIVE_L31 NATIVE_L32 NATIVE_L33 NATIVE_L34 NATIVE_L35 NATIVE_L36 NATIVE_L37 NATIVE_L38 NATIVE_L39 NATIVE_L40 NATIVE_L41 NATIVE_L42 NATIVE_L43 NATIVE_L44 NATIVE_L45 NATIVE_L46 NATIVE_L47 NATIVE_L48 NATIVE_L49 NATIVE_L50 NATIVE_L51 NATIVE_L52 NATIVE_L53 NATIVE_L54 NATIVE_L55 NATIVE_L56 NATIVE_L57 NATIVE_L58 NATIVE_L59 

You can obtain new (calculated) variables as follows. For instance, to obtain the proportion of residents who are between 20 and 34 years old, and between 35 and 49:

Hamilton_CT@data <- dplyr::transmute(Hamilton_CT@data, AREA = AREA, TRACT = TRACT,
                                     POPULATION = POPULATION,
                                     POP20to34 = (AGE_20_TO_ + AGE_25_TO_ + AGE_30_TO_),
                                     Prop20to34 = POP20to34/POPULATION, 
                                     POP35to49 = (AGE_35_TO_ + AGE_40_TO_ + AGE_45_TO_),
                                     Prop35to49 = POP35to49/POPULATION, 
                                     POP50to64 = (AGE_50_TO_ + AGE_55_TO_ + AGE_60_TO_),
                                     Prop50to64 = POP50to64/POPULATION,
                                     POP65Plus = (AGE_65_TO_ + AGE_70_TO_ + AGE_75_TO_ + AGE_80_TO_ + AGE_85),
                                     Prop65Plus = POP65Plus/POPULATION)

This is a SpatialPolygonDataFrame. Convert to a dataframe (“tidy” it) for plotting using ggplot2:

Hamilton_CT.t <- tidy(Hamilton_CT, region = "TRACT")
Hamilton_CT.t <- rename(Hamilton_CT.t, TRACT = id)

Rejoin the data:

Hamilton_CT.t <- left_join(Hamilton_CT.t, Hamilton_CT@data, by = "TRACT")
Column `TRACT` joining character vector and factor, coercing into character vector

Activity

  1. Create choropleth maps for the proportion of the population who are 20 to 34 years old, 35 to 49 years old, 50 to 65 years old, and 65 and older.
ggplot() + 
  geom_polygon(data = Hamilton_CT.t, aes(x = long, y = lat, group = group, fill = cut_number(Prop20to34, 5)), color = "white") + 
  coord_fixed() +
  scale_fill_brewer(palette = "YlOrRd") +
  labs(fill = "Prop Age 20 to 34")

ggplot() + 
  geom_polygon(data = Hamilton_CT.t, aes(x = long, y = lat, group = group, fill = cut_number(Prop35to49, 5)), color = "white") + 
  coord_fixed() +
  scale_fill_brewer(palette = "YlOrRd") +
  labs(fill = "Prop Age 35 to 49")

ggplot() + 
  geom_polygon(data = Hamilton_CT.t, aes(x = long, y = lat, group = group, fill = cut_number(Prop50to64, 5)), color = "white") + 
  coord_fixed() +
  scale_fill_brewer(palette = "YlOrRd") +
  labs(fill = "Prop Age 50 to 64")

ggplot() + 
  geom_polygon(data = Hamilton_CT.t, aes(x = long, y = lat, group = group, fill = cut_number(Prop65Plus, 5)), color = "white") + 
  coord_fixed() +
  scale_fill_brewer(palette = "YlOrRd") +
  labs(fill = "Prop Age 65+")

  1. Show your maps to a fellow student. What patterns do you notice in the distribution of age in Hamilton?

  2. Devise a rule to decide whether the pattern observed in a choropleth map is random.

Bonus

Create cartograms of the same variables above. Since the cartograms are created based on the SpatialPolygonsDataFrame:

Prop20to34.cartogram <- cartogram(shp = Hamilton_CT, weight = "POPULATION")
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 1: 5.94063097656517
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 2: 4.22282836609772
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 3: 3.24121918122005
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 4: 2.70487592121738
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 5: 2.44050500280525
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 6: 2.28238579460187
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 7: 2.14591026585647
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 8: 1.95255176465899
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 9: 1.8186943555214
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 10: 1.73775203417466
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 11: 1.64845060853967
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 12: 1.45162213155545
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 13: 1.37178691507706
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 14: 1.32738198642848
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 15: 1.29727221750122
Prop35to49.cartogram <- cartogram(shp = Hamilton_CT, weight = "POPULATION")
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 1: 5.94063097656517
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 2: 4.22282836609772
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 3: 3.24121918122005
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 4: 2.70487592121738
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 5: 2.44050500280525
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 6: 2.28238579460187
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 7: 2.14591026585647
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 8: 1.95255176465899
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 9: 1.8186943555214
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 10: 1.73775203417466
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 11: 1.64845060853967
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 12: 1.45162213155545
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 13: 1.37178691507706
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 14: 1.32738198642848
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 15: 1.29727221750122
Prop50to64.cartogram <- cartogram(shp = Hamilton_CT, weight = "POPULATION")
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 1: 5.94063097656517
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 2: 4.22282836609772
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 3: 3.24121918122005
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 4: 2.70487592121738
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 5: 2.44050500280525
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 6: 2.28238579460187
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 7: 2.14591026585647
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 8: 1.95255176465899
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 9: 1.8186943555214
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 10: 1.73775203417466
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 11: 1.64845060853967
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 12: 1.45162213155545
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 13: 1.37178691507706
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 14: 1.32738198642848
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 15: 1.29727221750122
Prop65Plus.cartogram <- cartogram(shp = Hamilton_CT, weight = "POPULATION")
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 1: 5.94063097656517
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 2: 4.22282836609772
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 3: 3.24121918122005
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 4: 2.70487592121738
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 5: 2.44050500280525
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 6: 2.28238579460187
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 7: 2.14591026585647
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 8: 1.95255176465899
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 9: 1.8186943555214
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 10: 1.73775203417466
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 11: 1.64845060853967
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 12: 1.45162213155545
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 13: 1.37178691507706
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 14: 1.32738198642848
Spatial object is not projected; GEOS expects planar coordinatesSpatial object is not projected; GEOS expects planar coordinatesMean size error for iteration 15: 1.29727221750122

Tidy and restore the data:

Prop20to34.cartogram.t <- tidy(Prop20to34.cartogram, region = "TRACT")
Prop20to34.cartogram.t <- rename(Prop20to34.cartogram.t, TRACT = id)
Prop20to34.cartogram.t <- left_join(Prop20to34.cartogram.t, select(Prop20to34.cartogram@data, TRACT, Prop20to34), by = "TRACT")
Column `TRACT` joining character vector and factor, coercing into character vector
Prop20to34.cartogram.t <- rename(Prop20to34.cartogram.t, Proportion = Prop20to34)
Prop35to49.cartogram.t <- tidy(Prop35to49.cartogram, region = "TRACT")
Prop35to49.cartogram.t <- rename(Prop35to49.cartogram.t, TRACT = id)
Prop35to49.cartogram.t <- left_join(Prop35to49.cartogram.t, select(Prop35to49.cartogram@data, TRACT, Prop35to49), by = "TRACT")
Column `TRACT` joining character vector and factor, coercing into character vector
Prop35to49.cartogram.t <- rename(Prop35to49.cartogram.t, Proportion = Prop35to49)
Prop50to64.cartogram.t <- tidy(Prop50to64.cartogram, region = "TRACT")
Prop50to64.cartogram.t <- rename(Prop50to64.cartogram.t, TRACT = id)
Prop50to64.cartogram.t <- left_join(Prop50to64.cartogram.t, select(Prop50to64.cartogram@data, TRACT, Prop50to64), by = "TRACT")
Column `TRACT` joining character vector and factor, coercing into character vector
Prop50to64.cartogram.t <- rename(Prop50to64.cartogram.t, Proportion = Prop50to64)
Prop65Plus.cartogram.t <- tidy(Prop65Plus.cartogram, region = "TRACT")
Prop65Plus.cartogram.t <- rename(Prop65Plus.cartogram.t, TRACT = id)
Prop65Plus.cartogram.t <- left_join(Prop65Plus.cartogram.t, select(Prop65Plus.cartogram@data, TRACT, Prop65Plus), by = "TRACT")
Column `TRACT` joining character vector and factor, coercing into character vector
Prop65Plus.cartogram.t <- rename(Prop65Plus.cartogram.t, Proportion = Prop65Plus)

Stack all cartograms into a single dataframe, add factor for frames:

Cartograms <- rbind(data.frame(Prop20to34.cartogram.t, factor = "Age 20 to 34"),
                    data.frame(Prop35to49.cartogram.t, factor = "Age 35 to 49"),
                    data.frame(Prop50to64.cartogram.t, factor = "Age 50 to 64"),
                    data.frame(Prop65Plus.cartogram.t, factor = "Age 65+"))

Create plot with frames:

Cartograms.plot <- ggplot(Cartograms, aes(x= long, y = lat, group = group, fill = cut_number(Proportion, 5))) + 
  geom_polygon(aes(frame = factor), color = "white") +
  scale_fill_brewer(palette = "YlOrRd") +
  coord_fixed() +
  theme(legend.position = "bottom") +
  labs(fill = "Proportion")
Ignoring unknown aesthetics: frame

Plot:

ggplotly(Cartograms.plot)
We recommend that you use the dev version of ggplot2 with `ggplotly()`
Install it with: `devtools::install_github('hadley/ggplot2')`

Try rectangular cartograms:

library(recmap)
package <U+393C><U+3E31>recmap<U+393C><U+3E32> was built under R version 3.4.3Loading required package: GA
package <U+393C><U+3E31>GA<U+393C><U+3E32> was built under R version 3.4.3Loading required package: foreach
foreach: simple, scalable parallel programming from Revolution Analytics
Use Revolution R for scalability, fault tolerance and more.
http://www.revolutionanalytics.com

Attaching package: <U+393C><U+3E31>foreach<U+393C><U+3E32>

The following objects are masked from <U+393C><U+3E31>package:purrr<U+393C><U+3E32>:

    accumulate, when

Loading required package: iterators
  ____    _    
 / ___|  / \     Genetic 
| |  _  / _ \    Algorithms
| |_| |/ ___ \   
 \____/_/   \_\  version 3.0.2
Type 'citation("GA")' for citing this R package in publications.
Loading required package: Rcpp
Package 'recmap' version 0.5.24

Rectangular algorithms require the centroids of the zones, which can be obtained by means of sp::coordinates

xy_centroids <- coordinates(Hamilton_CT)

Create a frame for the rectangular cartograms:

rec_frame <- data.frame(x = xy_centroids[,1], 
           y = xy_centroids[,2],
           dx = sqrt(Hamilton_CT$AREA) / 2 / (0.7 * 60 * cos(xy_centroids[,2] * pi / 180)),
           dy = sqrt(Hamilton_CT$AREA) / 2 / (0.7 * 60)) 

Generate the input for the recmap function. This is the frame for the cartograms rec_frame and a variable z:

POP.rec <- data.frame(rec_frame, z = Hamilton_CT$POPULATION + 1, name = Hamilton_CT$TRACT)
head(POP.rec)
POP.rec.cartogram <- recmap(POP.rec)
5370140.04 could not be placed on the first attempt;5370140.03 could not be placed on the first attempt;5370084.04 could not be placed on the first attempt;5370086.00 could not be placed on the first attempt;5370301.00 could not be placed on the first attempt;5370303.01 could not be placed on the first attempt;5370303.02 could not be placed on the first attempt;5370302.00 could not be placed on the first attempt;5370300.00 could not be placed on the first attempt;5370085.03 could not be placed on the first attempt;5370085.02 could not be placed on the first attempt;5370085.01 could not be placed on the first attempt;5370084.05 could not be placed on the first attempt;5370084.03 could not be placed on the first attempt;5370084.02 could not be placed on the first attempt;5370084.01 could not be placed on the first attempt;5370072.04 could not be placed on the first attempt;
plot(POP.rec.cartogram)

ggplot(data = POP.rec.cartogram) + geom_rect(aes(xmin = x - dx, xmax = x + dx, ymin = y - dy, ymax = y + dy)) + coord_fixed()

Cartograms.rec <- rbind(data.frame(POP.rec.cartogram,
                                   Proportion = Hamilton_CT$Prop20to34,
                                   factor = "Age 20 to 34"),
                        data.frame(POP.rec.cartogram,
                                   Proportion = Hamilton_CT$Prop35to49,
                                   factor = "Age 35 to 49"),
                        data.frame(POP.rec.cartogram,
                                   Proportion = Hamilton_CT$Prop50to64,
                                   factor = "Age 50 to 64"),
                        data.frame(POP.rec.cartogram,
                                   Proportion = Hamilton_CT$Prop65Plus,
                                   factor = "Age 65+"))

Create plot with frames:

Cartograms.rec.plot <- ggplot(Cartograms.rec, aes(xmin = x - dx, xmax = x + dx, ymin = y - dy, ymax = y + dy, fill = cut_number(Proportion, 5))) + 
  geom_rect(aes(frame = factor)) +
  scale_fill_brewer(palette = "YlOrRd") +
  coord_fixed() +
  theme(legend.position = "bottom") +
  labs(fill = "Proportion")
Ignoring unknown aesthetics: frame

Plot:

plot <- Cartograms.rec.plot %>% animation_opts(frame = 1000, transition = 1000, easing = "elastic")
ggplotly(plot)
We recommend that you use the dev version of ggplot2 with `ggplotly()`
Install it with: `devtools::install_github('hadley/ggplot2')`
---
title: "04 Session 4: Point Pattern Analysis I"
output: html_notebook
---

#Practice questions

Answer the following questions:

1. What is a key difference between area data and point data?
2. What is a choropleth map?
3. What is a cartogram?
4. What are the advantages and disadvantages of these mapping techniques?

#Learning objectives

In this activity, you will:

1. Use the concept of quadrats to analyze a real dataset.
2. Learn about a quadrat-based test for randomness in point patterns.
3. Learn how to use the p-value of a statistical test to make a decision.
4. Think about the distribution of events in a null landscape.
5. Think about ways to decide whether a landscape is random.

#Suggested reading

O'Sullivan D and Unwin D (2010) Geographic Information Analysis, 2nd Edition, Chapter 7. John Wiley & Sons: New Jersey.

#Preliminaries

For this activity you will need the following:

* This R markdown notebook.
* A dataset called `Toronto Business Points.RData`.

It is good practice to clear the working space to make sure that you do not have extraneous items there when you begin your work. The command in R to clear the workspace is `rm` (for "remove"), followed by a list of items to be removed. To clear the workspace from _all_ objects, do the following:
```{r}
rm(list = ls())
```

Note that `ls()` lists all objects currently on the worspace.

Load the libraries you will use in this activity. In addition to `tidyverse`, you will need `spatstat`, a package designed for the analysis of point patterns (you can learn about `spatstat` [here](https://cran.r-project.org/web/packages/spatstat/vignettes/getstart.pdf) and [here](http://spatstat.org/resources/spatstatJSSpaper.pdf)):
```{r}
library(tidyverse)
library(rgdal)
library(broom)
library(cartogram)
library(plotly)
```

In the practice that preceded this activity, you learned about the area data and visualization techniques for area data.

Begin by loading the data that you will use in this activity:
```{r}
Hamilton_CT <- readOGR(".", layer = "Hamilton CMA CT", integer64 = "allow.loss")
```

You can obtain new (calculated) variables as follows. For instance, to obtain the proportion of residents who are between 20 and 34 years old, and between 35 and 49:
```{r}
Hamilton_CT@data <- dplyr::transmute(Hamilton_CT@data, AREA = AREA, TRACT = TRACT,
                                     POPULATION = POPULATION,
                                     POP20to34 = (AGE_20_TO_ + AGE_25_TO_ + AGE_30_TO_),
                                     Prop20to34 = POP20to34/POPULATION, 
                                     POP35to49 = (AGE_35_TO_ + AGE_40_TO_ + AGE_45_TO_),
                                     Prop35to49 = POP35to49/POPULATION, 
                                     POP50to64 = (AGE_50_TO_ + AGE_55_TO_ + AGE_60_TO_),
                                     Prop50to64 = POP50to64/POPULATION,
                                     POP65Plus = (AGE_65_TO_ + AGE_70_TO_ + AGE_75_TO_ + AGE_80_TO_ + AGE_85),
                                     Prop65Plus = POP65Plus/POPULATION)
```

This is a `SpatialPolygonDataFrame`. Convert to a dataframe ("tidy" it) for plotting using `ggplot2`:
```{r}
Hamilton_CT.t <- tidy(Hamilton_CT, region = "TRACT")
Hamilton_CT.t <- rename(Hamilton_CT.t, TRACT = id)
```

Rejoin the data:
```{r}
Hamilton_CT.t <- left_join(Hamilton_CT.t, Hamilton_CT@data, by = "TRACT")
```

#Activity

1. Create choropleth maps for the proportion of the population who are 20 to 34 years old, 35 to 49 years old, 50 to 65 years old, and 65 and older. 

```{r}
ggplot() + 
  geom_polygon(data = Hamilton_CT.t, aes(x = long, y = lat, group = group, fill = cut_number(Prop20to34, 5)), color = "white") + 
  coord_fixed() +
  scale_fill_brewer(palette = "YlOrRd") +
  labs(fill = "Prop Age 20 to 34")

```

```{r}
ggplot() + 
  geom_polygon(data = Hamilton_CT.t, aes(x = long, y = lat, group = group, fill = cut_number(Prop35to49, 5)), color = "white") + 
  coord_fixed() +
  scale_fill_brewer(palette = "YlOrRd") +
  labs(fill = "Prop Age 35 to 49")

```

```{r}
ggplot() + 
  geom_polygon(data = Hamilton_CT.t, aes(x = long, y = lat, group = group, fill = cut_number(Prop50to64, 5)), color = "white") + 
  coord_fixed() +
  scale_fill_brewer(palette = "YlOrRd") +
  labs(fill = "Prop Age 50 to 64")

```

```{r}
ggplot() + 
  geom_polygon(data = Hamilton_CT.t, aes(x = long, y = lat, group = group, fill = cut_number(Prop65Plus, 5)), color = "white") + 
  coord_fixed() +
  scale_fill_brewer(palette = "YlOrRd") +
  labs(fill = "Prop Age 65+")

```

2. Show your maps to a fellow student. What patterns do you notice in the distribution of age in Hamilton?

3. Devise a rule to decide whether the pattern observed in a choropleth map is random.

# Bonus

Create cartograms of the same variables above. Since the cartograms are created based on the `SpatialPolygonsDataFrame`:
```{r}
Prop20to34.cartogram <- cartogram(shp = Hamilton_CT, weight = "POPULATION")
Prop35to49.cartogram <- cartogram(shp = Hamilton_CT, weight = "POPULATION")
Prop50to64.cartogram <- cartogram(shp = Hamilton_CT, weight = "POPULATION")
Prop65Plus.cartogram <- cartogram(shp = Hamilton_CT, weight = "POPULATION")
```

Tidy and restore the data:
```{r}
Prop20to34.cartogram.t <- tidy(Prop20to34.cartogram, region = "TRACT")
Prop20to34.cartogram.t <- rename(Prop20to34.cartogram.t, TRACT = id)
Prop20to34.cartogram.t <- left_join(Prop20to34.cartogram.t, select(Prop20to34.cartogram@data, TRACT, Prop20to34), by = "TRACT")
Prop20to34.cartogram.t <- rename(Prop20to34.cartogram.t, Proportion = Prop20to34)

Prop35to49.cartogram.t <- tidy(Prop35to49.cartogram, region = "TRACT")
Prop35to49.cartogram.t <- rename(Prop35to49.cartogram.t, TRACT = id)
Prop35to49.cartogram.t <- left_join(Prop35to49.cartogram.t, select(Prop35to49.cartogram@data, TRACT, Prop35to49), by = "TRACT")
Prop35to49.cartogram.t <- rename(Prop35to49.cartogram.t, Proportion = Prop35to49)

Prop50to64.cartogram.t <- tidy(Prop50to64.cartogram, region = "TRACT")
Prop50to64.cartogram.t <- rename(Prop50to64.cartogram.t, TRACT = id)
Prop50to64.cartogram.t <- left_join(Prop50to64.cartogram.t, select(Prop50to64.cartogram@data, TRACT, Prop50to64), by = "TRACT")
Prop50to64.cartogram.t <- rename(Prop50to64.cartogram.t, Proportion = Prop50to64)

Prop65Plus.cartogram.t <- tidy(Prop65Plus.cartogram, region = "TRACT")
Prop65Plus.cartogram.t <- rename(Prop65Plus.cartogram.t, TRACT = id)
Prop65Plus.cartogram.t <- left_join(Prop65Plus.cartogram.t, select(Prop65Plus.cartogram@data, TRACT, Prop65Plus), by = "TRACT")
Prop65Plus.cartogram.t <- rename(Prop65Plus.cartogram.t, Proportion = Prop65Plus)
```

Stack all cartograms into a single dataframe, add factor for frames:
```{r}
Cartograms <- rbind(data.frame(Prop20to34.cartogram.t, factor = "Age 20 to 34"),
                    data.frame(Prop35to49.cartogram.t, factor = "Age 35 to 49"),
                    data.frame(Prop50to64.cartogram.t, factor = "Age 50 to 64"),
                    data.frame(Prop65Plus.cartogram.t, factor = "Age 65+"))
```

Create plot with frames:
```{r}
Cartograms.plot <- ggplot(Cartograms, aes(x= long, y = lat, group = group, fill = cut_number(Proportion, 5))) + 
  geom_polygon(aes(frame = factor), color = "white") +
  scale_fill_brewer(palette = "YlOrRd") +
  coord_fixed() +
  theme(legend.position = "bottom") +
  labs(fill = "Proportion")
```

Plot:
```{r}
ggplotly(Cartograms.plot)
```

Try rectangular cartograms:
```{r}
library(recmap)
```

Rectangular algorithms require the centroids of the zones, which can be obtained by means of `sp::coordinates`
```{r}
xy_centroids <- coordinates(Hamilton_CT)
```

Create a frame for the rectangular cartograms:
```{r}
rec_frame <- data.frame(x = xy_centroids[,1], 
           y = xy_centroids[,2],
           dx = sqrt(Hamilton_CT$AREA) / 2 / (0.7 * 60 * cos(xy_centroids[,2] * pi / 180)),
           dy = sqrt(Hamilton_CT$AREA) / 2 / (0.7 * 60)) 
```

Generate the input for the `recmap` function. This is the frame for the cartograms `rec_frame` and a variable z:
```{r}
POP.rec <- data.frame(rec_frame, z = Hamilton_CT$POPULATION + 1, name = Hamilton_CT$TRACT)
head(POP.rec)
```

```{r}
POP.rec.cartogram <- recmap(POP.rec)
```

```{r}
plot(POP.rec.cartogram)
```

```{r}
ggplot(data = POP.rec.cartogram) + geom_rect(aes(xmin = x - dx, xmax = x + dx, ymin = y - dy, ymax = y + dy)) + coord_fixed()
```

```{r}
Cartograms.rec <- rbind(data.frame(POP.rec.cartogram,
                                   Proportion = Hamilton_CT$Prop20to34,
                                   factor = "Age 20 to 34"),
                        data.frame(POP.rec.cartogram,
                                   Proportion = Hamilton_CT$Prop35to49,
                                   factor = "Age 35 to 49"),
                        data.frame(POP.rec.cartogram,
                                   Proportion = Hamilton_CT$Prop50to64,
                                   factor = "Age 50 to 64"),
                        data.frame(POP.rec.cartogram,
                                   Proportion = Hamilton_CT$Prop65Plus,
                                   factor = "Age 65+"))
```

Create plot with frames:
```{r}
Cartograms.rec.plot <- ggplot(Cartograms.rec, aes(xmin = x - dx, xmax = x + dx, ymin = y - dy, ymax = y + dy, fill = cut_number(Proportion, 5))) + 
  geom_rect(aes(frame = factor)) +
  scale_fill_brewer(palette = "YlOrRd") +
  coord_fixed() +
  theme(legend.position = "bottom") +
  labs(fill = "Proportion")
```

Plot:
```{r}
plot <- Cartograms.rec.plot %>% animation_opts(frame = 1000, transition = 1000, easing = "elastic")
ggplotly(plot)
```